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Abstract —Optimization of a point-to-point (p2p) multiple- 
input single-output (MISO) communication system is considered 
when both the transmitter (TX) and the receiver (RX) have 
energy harvesting (EH) capabilities. The RX is interested in 
feeding back the channel state information (CSI) to the TX to 
help improve the transmission rate. The objective is to maximize 
the throughput by a deadline, subject to the EH constraints 
at the TX and the RX. The throughput metric considered is 
an upper bound on the ergodic rate of the MISO channel 
with beamforming and limited feedback. Feedback bit allocation 
and transmission policies that maximize the upper bound on 
the ergodic rate are obtained. Tools from majorization theory 
are used to simplify the formulated optimization problems. 
Optimal policies obtained for the modified problem outperform 
the naive scheme in which no Intelligent management of energy 
is performed. 

Index Terms —Energy harvesting, Limited feedback, MISO, 
Offline optimization. 

I. Introduction 

Powering up terminals in communication networks by 
renewable ambient energy reduces the carbon footprint of 
the information and communication technologies, which can 
no longer be neglected with the exponential growth in the 
number of communication devices. Another advantage of EH 
technology is that, it increases the autonomy of battery-run 
communication devices. In traditional wireless networks nodes 
get their energy from the power grid by always or periodically 
connecting to it. While it is easy to connect the terminals to the 
grid in some networks, in others, such as sensor networks, it 
cannot be done once after the deployment. Therefore, in such 
networks a node’s lifetime, and hence, the network lifetime, 
is constrained by the limited initial energy in the battery. 
Providing EH capabilities to the communication nodes is an 
attractive solution to the network lifetime problem |[T|. An 
EH node can scavenge energy from the environment (typical 
sources are solar, wind, vibration, thermal, etc.) la. With EH 
nodes in the network, in principle, one can guarantee perpetual 
lifetime without the need of replacing batteries. 

However, EH poses a new design challenge as the energy 
sources are typically sporadic and random. The main challenge 
lies in ensuring the Quality of Service (QoS) constraints of the 
network given the random and time varying energy sources. 
This calls for the intelligent management of various parameters 
involved in a communication system. 

Recently, a significant number of papers have appeared 
studying the optimal transmission schemes for EH commu¬ 


nication systems under different assumptions regarding the 
node’s knowledge about the underlying EH process. Offline 
optimization framework deals with systems in which non- 
causal knowledge of the EH process is available. Within this 
frame work, optimal transmission schemes are studied for the 
p2p fading channel O, broadcast channel lH, ||5], 0 and 
relay channel lEl, El. See |j9l for an extensive overview. 

To the best of our knowledge, a common aspect of all 
prior works on EH communication networks is that the TX 
is assumed to have access to perfect CSI. Knowledge of 
the CSI at the TX is beneficial in designing the optimal 
channel adaptation techniques and the TX Alters in multi¬ 
antenna systems. However, recent studies have demonstrated 
that, although feedback enhances the system performance, 
feedback resources, namely power and bandwidth, are limited, 
and must be spent wisely GO). As a result, an important 
question arises: How do the EH constraints affect the design 
of feedback enabled wireless networks? 

In this paper, we study the optimization of a feedback 
enabled EH MISO channel, where feedback is used to improve 
the rate through array gain. The system model and the main 
assumptions in this paper are given in Section III. In Section 
IV, we consider the optimization of the feedback policy under 
EH constraints at the RX, while the TX is assumed to have 
a constant power supply. The motivation is to address the 
following: In the case of EH, the available energy at the RX 
varies over time. Should the RX feedback same quality of CSI 
at all times? If so, can the CSI feedback quality be improved 
by using more bandwidth in the low energy scenario? In the 
second part of this paper (Section V), we assume that both the 
TX and the RX harvest energy. In this case, the transmission 
power policy and the feedback policy are coupled, and need 
to be jointly optimized. Results from multivariate majorization 
theory are used to devise simple algorithms. We start by 
giving a brief preliminary description of majorization theory 
in Section II. Numerical results are presented in Section VI to 
validate the analysis. Einally, Section VII concludes the paper. 

Notation: Boldface letters are used to denote matrices and 
vectors. The transpose and conjugate transpose of matrix A is 
denoted by and A^, respectively. We use dij to denote 
the element at the i-th row and j-th column of matrix D, and 
|§| to denote the cardinality of the set §. The set of integers 
from m to n, m < n, is represented by [m : n]. The algorithm 
with name “Algo” is represented as [output arguments]= Algo 
(input arguments). A circularly-symmetric complex Gaussian 
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distributed random variable rj with zero mean and variance cr^ 
is denoted by 77 ~ C3Nf(0,cr^). 

II. PRELIMINARIES 

In this section, the basic notion of majorization is introduced 
and some important inequalities on convex functions that are 
used in this work are stated. The readers are referred to 
HD, na for a complete reference. We start by stating the 
Edmundson-Madansky’s inequality. 

Theorem 1: HD If / is a convex function and cc is a random 
variable with values in an interval [a, &], then 

E[/(x)]<^/(a) + ^/(5), 

0 — a 0 — a 

where p, is the mean of x. 

Majorization theory formalizes the notion that the compo¬ 
nents of a vector x are “less spread out” than the components 
of a vector y. 

Definition 1: Let a; = [xi,..., a:„], y = [yi,. •., y„], 
x,y € R." and let a;(i) denote the i-th largest component of 

X. Then x is said to be majorized by y, denoted by a; A y, if 

i i 

n n 

i=l 

Definition 2: m 2.A . 1 ] An nxn matrix D with elements 
dij is doubly stochastic if 

dtj>0, yi,j €[1 :n], 

n n 

dij = 1 , Vj G [1 : n] and dij = 1, V* G [1 : n] . 

i=i i=i 

Theorem 2: IfT^ 4.A.1, 4.B.1] Eor a;, y G R", the following 
conditions are equivalent: 

• x<y. 

• X = yD for some doubly stochastic matrix D. 

• Eor all continuous concave functions y : R —>■ R, 

Er=i5(^0>Er=i5(y0- 

Definition 3: m 15.A.2] Let X and Y be m x n real 
matrices. Then X is said to be majorized by Y, written X A 

Y, if X = YD, where the nxn matrix D is doubly stochastic. 
Theorem 3: HU 15.A.4] Let X and Y be m x n real 

matrices. Then, X A Y if and only if 

n n 

E5(a^D>E5(yr), 

i=l i=l 

for all continuous concave functions g : R™ —R; here x’j and 
yf denote the Ath column vector of X and Y, respectively. 

III. System model 

We consider a p2p MISO fading channel as shown in Eig. [D 
where both the TX and the RX harvest energy from the 
environment. Each node is equipped with an individual energy 
buffer, i.e., a rechargeable battery, that can store the locally 
harvested energy. 
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Figure 1. MISO channel with feedback, where both the TX and the RX 
harvest and store ambient energy. 
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Figure 2. Energy harvesting time frame structure. 

A. Energy Harvesting Model 

The total observation time is divided into K equal length EH 
intervals. At the beginning of the Ic-th EH interval, fc G [1 : AT], 
energy packets of size e\. , units arrive at the TX and the 
RX, respectively. At each node, this energy is first stored in an 
infinite size energy buffer, and used only for communication 
purposes, i.e., TX sending data, and the RX feeding back the 
CSI. We assume that all e\, e^’s are known in advance by both 
terminals. This model is suitable for an EH system in which 
the time-varying harvested energy can be accurately predicted 

m. 

B. Communication System Model 

Each EH interval consists of L data frames, each of length 
T channel uses. We assume a block fading channel model. The 
channel is constant during T channel uses of each frame, but 
changes in an independent and identically distributed (i.i.d.) 
fashion from one frame to another. The time frame structure is 
shown in Eig. ID The TX has M > 1 antennas, while the RX 
has a single antenna. The received signal in a given channel 
use is given by 

y = hb^ws + ry, ( 1 ) 

where h G represents the vector of channel coefficients 

from TX to the RX with i.i.d. 624(0,1) elements, w G 
denotes the beamforming vector, the input symbol maximizing 
the achievable ergodic rate in the /c-th EH interval is s ^ 
CN(0,pfe), and rj ^ C24(0,1) represents the noise at the RX. 

C. Feedback Model 

We assume that the RX perfectly estimates the channel 
state at the beginning of each data frame, and feeds back 
the quantized CSI to the TX within the same frame. In the 
k-th EH interval, the frame structure is as follows: The RX 
in Tfe channel uses sends the CSI through a feedback channel 
(uplink) which is modeled as an additive white Gaussian noise 
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(AWGN) channel. In the remaining T — Tk channel uses, 
TX sends data to the RX (downlink) exploiting the obtained 
CSI. The feedback model represents the Time-Division Duplex 
(TDD) system in which uplink and downlink use the same 
band in a time-sharing fashion, but the communication de¬ 
vices are not self-calibrated, and hence, induce non-reciprocal 
effects lfT3ll . lfT4l . In the above model, although the feedback 
overhead incurs a cost in the downlink bandwidth, a similar 
trade-off in the resource allocation between the CSI feedback 
quality and uplink data rate also arise in a Frequency-Division 
Duplex (FDD) system m. Hence, the analytical results 
obtained in this paper are applicable in general settings, and 
for instance, can be used to address the trade-off between CSI 
quality and effective data rate in an FDD system. 

In the A:-th EH interval, quantization of the channel state 
is performed using a codebook Ck known at both the TX 
and RX. The receiver uses Random Vector Quantization 
(RVQ). The codebook consists of M-dimensional unit vectors 
Ck = where bk is the number of bits used 

for quantization. The RX chooses the beamforming vector 

according to Wk = arg max \h^f\ , where h = tt^. 
feCk 

We assume that the length of the EH interval is very large 
compared to the channel coherence time (i.e., L is very large). 
As a result, the achievable ergodic rate in the fc-th EH interval 
is given by 



log2 


1 + 


Pk 

(i-¥) 



2 

where Uk = \h^Wk\ ■ Note that and ||fi|p are independent 
E). By using the AWGN feedback channel model, the 
number of feedback bits bk can be related to the energy used 
by the RX, qk, and the number of channel uses Tk as follows: 


h 


Tk l0g2 



(3) 


where is the noise variance in the uplink. Eor analytical 
tractability, we neglect the practical constraint that bk should 
be an integer. Using the ergodic rate expression given in ifTSl 
Equation (27)] and (O, the ergodic rate Rk = R{pk,qk,Tk) 
is found to be 


( M-l 

^ (pk) - 

1=0 

I (S) 

Uk=0 / 

(4) 

where pk = , and En{x) = 

is the n-th order exponential integral. 


D. Optimization Problem 

The problem of maximizing the sum throughput by the end 
of the K-th EH interval can be formulated as 

K 

max Rk (5 a) 

Pk,<lk,Tk 

k—\ 

I I 

s.t. V/e[l:iF], (5b) 

I I 

LT W G [1 : a:], (5c) 

i—1 i—1 

Tk G [0,r), Pk > 0, and qk > 0,VA: G [1 : K], 

(5d) 

The constraints ( l5bb and © guarantee the energy neutrality 
of the system, i.e., at each node, energy consumed can not be 
more than the energy harvested till that time. Also note that 
Tk impacts the achievable rate Rk in each EH interval. 

Coming up with simple algorithms to solve the optimization 
problem is desirable in EH networks as the nodes may not have 
the computational and energy resources for running complex 
optimization algorithms. However, the ergodic rate expression 
used in the above optimization problem is not in closed form 
and offers little insight into the convexity of the problem 
which is required to reduce the complexity of optimization. 
This motivates the use of convex bounds on @ as the 
objective function in the following optimization problems. 
Solving these modified problems provides an upper bound on 
the throughput. Since the constraints in the original and the 
modified optimization problems are the same, the solution for 
the modified problem is also feasible in the original problem, 
and if used in evaluating the exact rate expression in (|4|i, we 
obtain a lower bound on the throughput. In some settings, we 
show that the bounds used are very close to the ergodic rate. 

Before tackling the above problem, first, we consider a 
special case in which only the RX harvests energy. Later, the 
general case with both the TX and the RX harvesting energy 
is studied. 


IV. EH Receiver 

In this setting, the RX harvests energy from the environ¬ 
ment, whereas the TX is connected to the power grid so that 
it has a fixed power supply at all times. Therefore, there are 
no EH constraints at the TX, and constraints (l5^ can be 
ignored. However, there is now a constraint on the average 
transmission power at each data frame of the fc-th EH interval 
i-e., Pk < P, Vfc. The expected value Uk is given by Qsi, im 

= (6) 

where /3 {x, y) denotes the beta function. Using the quantiza¬ 
tion error bound in iflhl Lemma 6], (|6l) can be bounded a^ 

EK] < 1^1 = 1 - 2^- (7) 

^This bound is universal in the sense that it applies to any fe^-bit quanti¬ 
zation of an isotropically distributed vector, not necessaiily limited to RVQ. 









4 


Applying Jensen’s inequality on (|2]i, substituting Q and Q, 
and using the fact that E ||h,|p = M, an upper bound on the 
ergodic rate (p^, qk,Tk) is obtained as 


Rk = ik logs 


1 + 


PkM 


1 - 


M -1 


M 


1 + 


Qk 

Tkcr"^ 


M-1 


( 8 ) 

where tk = — ^)- 

We now illustrate the tightness of the upper bound. Ap¬ 
plying the Jensen’s inequality on (|2]l, — Rk can be lower 

bounded as 


Rl-Rk> tk logs ( 1 + ) - 


tk E||h.|p logs ( 1 + 


tk 


' R-ivk] 


(9) 


Since (|2]) is a concave function of Vk and i^k S [0,1], applying 
Theorem [T] on (|2]l, we have 


Rk > tk E||h ||2 logs ll^ll^j E[z^fe] 

Now using (fTOl) . — Rk can be upper bounded as 

Rk-Rk< tk logs (^1 + ) - 

tk E||b ||2 logs E[r'fc] 


( 10 ) 


( 11 ) 


= 1 and limf,j^^oo E[z/fc] = 1 QS], 


Since both lim, 
and using @ and (fTTT i. we have. 


ARk= lim = 4E||^I|2 logs 

bfe-s-oo 


tk + PkM 

tk+Pk ii/iir 


( 12 ) 


Further, for all feasible Tk, in the low power regime. 


lim ARk = 0, (13) 

Pfc->0 

and in the high power regime, 

lim ARk = tk (logs M - E||h ||2 logs ll^lP) 

Pk^oo 

< logs M-E||h ||2 logs ||/r|p. 

From the above analysis, it can be seen that when the RX has 
enough harvested energy to send large number of feedback 
bits, in the low power regime the bound is tight, and in the 
high power regime the difference is bounded by a constant. 
For example, it is 0.1958 for M = A, and also note that 
limM^-oo logs M - E ||^||2 logs II^IP = 0. 

Using © as the objective function, the modified optimiza¬ 
tion problem can be written as follows. 


max 

Pk,qk,rk 


s.t. 


K 

U = ^i? 

k^l 


u 

k 


I I 

2—1 2 = 1 

Pk < P, and pk >0, Vfc e [1 : K], 
Tk€[0,T), and qk>0, Vk e [1 : K], 


(15a) 

(15b) 


where p is the power constraint at the transmitter. 

As the objective function is monotonic in qk and pk, 
the constraint in ( I15bl i must be satisfied with equality for 
I = K, and the hrst constraint in ( I15cl l must be satished with 
equality, i.e., pk = p, Vfc; otherwise, we can always increase 
qK,Pk, and hence, the objective function, without violating 
any constraints. Now it remains to optimize over the variables 
qk and Tk. 

The feasible set is represented as 

i?= ^Q.TlQk.Tk satisfy ( |15b| l, ([TSdll Vfc} , (16) 

where q = [gi,..., qx] and r = [n,..., tk]- To show that 
the above problem is a convex optimization problem, we make 
use of the following lemma. 

Lemma 1: If the function f {x,t) : R+ is concave, 

and g {y, z) : —5> R+ is concave and monotonically 

increasing in each argument, then the function h {x, y, t) = 
( 1-^)5 (li’ 4^) concave V(a;,y) e R\,t € [0,T). 

Proof: The proof is similar to that of showing the 
perspective of a concave function is concave. See Appendix. 

■ 

Proposition 1: The objective function of the optimization 
problem (fTsT i is concave. 

Proof: See Appendix. ■ 

Since the objective function in (fTSl) is concave and the 
constraints are linear, it has a unique maximizer El . Using the 
concavity of the objective function, we show that the optimal 
energy allocation vector is the most majorized feasible energy 
vector. 

Proposition 2: The global optimum of (fTST i is obtained at 
{q*,T*), where q* A q,y{q,T) € jj, and is the solution 
of the following equation 

(17) 

Proof: Consider the following equivalent form of ( fTSl ). 
where the optimization is performed in two steps. 

max'U(q) s.t. V (q, r) G jj, (18) 

Q 

where IX (q) is obtained by 

U (q) = max 11 (q, x) s.t. V (q, x) G jj. (19) 

T 

Since IX is a concave function over the convex set the 
function IX (q) is concave, where the domain of IX is the set 
5 = {qI (Qi t) G )?} El 3.2.5]. IX = Y^k=i Rk E continuous, 
differentiable and concave in Tk G [0,r). Furthermore, for 
given qk, Rk approaches log 2 (1 + p) and 0, as Tk approaches 
0 and T, respectively. Therefore, the unique maximizer of ( fT9b 
lies in [0,T), and it is obtained at 

f)'] / f) 

^k=^k=0,V.G[l:JT]. (20) 

From above, as is only a function of 

K 

U{q)^Y.^k 

k=l 


(15c) 

(15d) 


( 21 ) 
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Figure 3. Optimal number of channel uses for sending feedback. 


where Rl IT-[q^) = {qk,T^ (qk))- Using dH) and 
Theorem |2] U{q*) > U{q),\/q G Finding the optimal 
energy allocation vector q* under the EH constraints turns out 
be a well known problem, and the algorithm to construct q* is 
given in various works GSi-iaoi. The proof that the algorithm 
constructs the most majorized feasible energy vector is given 
in 1201 . Since the optimal energy allocation vector is q*, the 
optimal T* is obtained by (fTTI l. ■ 

A brief description of the algorithm tailored to this work is 
given next, while the details can be found in lT8ll - i20ll . There 
is no closed form expression for the solution of ([TtI) . hence 
we resort to numerical methods to obtain r*. Fig. |3] shows the 
behavior of as a function of the allocated energy q^. 


A. Optimal Energy Allocation 

From Definition [T] we can see that the components of 
the most majorized energy vector are "less spread out" than 
any other feasible energy vector. Therefore, the algorithm 
essentially tries to make the energy vector as equalized as 
possible. This is done by spreading the energy to future 
intervals. However, note that the energy arriving in later 
intervals cannot be spread to earlier intervals due to the EH 
constraints. The Optimal Energy Allocation (OEA) algorithm, 
given in Algorithm [1] divides the EH intervals into |S| energy 
bands whose indices form the set § = {Bq, Si,. •. i?|s| 
where Bi < < j, Bq = 0, and S|s| = K. 

The i-th energy band contains the EH intervals with indices 
k G [Bi-i + 1 : Bi], Moreover, the optimal allocated energy 
values in each EH interval belonging to the i-th energy band 
are equal, and denoted by < 7 ^^^. The energy vector q* obtained 
by [9*,§r] = OEA(iT, {e^/L}), has the following properties: 




(PI) = Qh = > Vfc G [S,_1 + 1 : S,]. 

(P2) The entries are strictly monotonic, i.e., q*^^ < q*^^ < 
< ^dsi)- 


Input : EH intervals K\ Harvested energy {ci} 
Output: Energy allocation o*. Energy band indices 
§ = {Sq, Bi, ... S|s|} 

// initialization 
Bo := 0 ; 

for i = 1 : K do 

for fc = a: : -1 : (S,_i + 1) do 

(i) ’ ZG{S,_i + l,...,fc} 

if Ei=i o* < Ei=i ez, I = 1, ■■■, K then 
Bi = k\ 

Save ,ol] 

break; 

end 

end 

if Bi == K then 
I break; 
end 

end 

Algorithm 1: Optimal Energy Allocation (OEA) algorithm 


to send feedback in this interval. Therefore, without loss of 
optimality we only consider EH profiles where e\ > 0. Other¬ 
wise, if there is an EH profile such that = 0, fc G [1 : m— 1], 
then pk — 0, k € [1 : m — 1] due to the constraints in (|5c|. 
In these intervals the RX simply accumulates the harvested 
energy, and without loss of optimality we can have a new 
EH profile with e{ = G [1 : K — m + 1], and 

ei' = ELi Sfc and G [2 : AT - m -f 1] for 

further analysis. 

The ergodic rate upper bound in (O is not concave, but 
concave in each variable given the other variables are fixed. 
To obtain a simple algorithm and an upper bound on the 
throughput, we follow a similar approach as in the previous 
section, and use a concave upper bound on ® as the objective 
function for throughput optimization. 

This bound is obtained by using a hypothetical system in 
which the transmission power is 1 watt higher than the actual 
transmission power of the system, which is Pk/tk- Plugging 
this into the upper bound in ®, a new upper bound = 
{pk, qk,Tk) on the ergodic rate is obtained: 

= 4log2(l+(^l + ^)^), (22) 

-•^fc 

where tk = I - ^ and fk = M - {M - 1) (l + ' 

We now illustrate the tightness of the upper bound in (1221 in 
the low and high power regimes. For all feasible Tk,pk and 
qk, we can see that 0 < ffc < 1 and 1 < /fc < M. Consider 


V. EH Transmitter and Receiver 

In this section, we consider the general case where both the 
TX and the RX harvest energy. Note that if the TX is silent in 
the fc-th interval, i.e., pk = 0, there is no incentive for the RX 


R 


ub 

k 


-Rl 


tk l 0 g 2 


/ + tkfk + Pkfk \ 

V tk+Pkfk J 


tk l 0 g 2 (tk) 

(23) 


Note that ( |2^ is decreasing in pk for fixed and g^,. Since 
Tk,fk are bounded, for fixed Tk and qk, in the low power 
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regime 

\im Rf-R-,=tklogJl + ^) 

\ tk J (24) 

< log2 (1 + M), 

and in the high power regime, 

lim Rf -Rk= -tk log2(4) < 0.5. (25) 

From the above analysis, it can be seen that, (l2?b decreases as 
the power is increased, and it is bounded by a constant in the 
high power regime. By using (|2^ . the modified throughput 
maximization problem is formulated as 


K 


max Ui = 

Pk,qk,rk ^^ 

k—1 

7 7 

(26a) 

L L 

s.t. <^e[,VZG[l:iT], 

2 = 1 2=1 

7 7 

(26b) 

L L 

<^e‘,WG [1 :iT], 

(26c) 


i—1 2=1 


Tk e [0,T), Pk >0, qk> 0, and Vfc e [1 : K]. 

(26d) 

Since the objective function is monotonic in qk and pk, the 
constraints in (I26bb and (I26cl i must be satisfied with equality 
for I — K, otherwise, we can always increase qK,PK, and 
hence the objective function, without violating any constraints. 
The feasible set is represented as 

5 = {(P, q, t) \pk, qk, Tk satisfy ( |26bb . ( |26cb and (|26dl l Vfc} , 


where p = [pi,...,pk], q = [qi,...,qK] and r = 
[ti,...,tk]- 

Proposition 3: The objective function in the optimization 
problem (l26l l is concave. 

Proof: See Appendix. ■ 

Since the objective function in (l26l l is concave and the con¬ 
straints are linear, it has a unique maximizer QT). Consider 
the following equivalent form of ( l26l l. where the optimization 
is performed in two steps. 

max 111 (p, q) s.t. V (p, q, r) e J, (27) 

p.q 

where Hi (p, q) is obtained by 

lii (p, q) = max Ui (p, q, t) s.t. V (p, q, t) G Z- (28) 

T 

Since Hi is a concave function over the convex set Z, the func¬ 
tion Hi is concave with domain j) = {(p, q) \ (p, q, r) G J} 
In] 3.2 .5]. Hi = continuous, differentiable and 

concave in Tk G [0, T). Furthermore, for given pk and qk, Rf^ 
approaches log 2 {2+Pk) and 0, as Tk approaches 0 and T, 
respectively. Therefore, the unique maximizer of (l28T l. , Vfc 

lies in [0,T), and it is obtained as 

I A* = ^ I A* = 0’ Vfc G [1 : iT] . (29) 

dTk ” dTk 

As T^ is only a function of qk and pk, (l27l l can be written as 


max 

Pk,qk 


K 

Hi=^i?f s.t. Vfc,(pfc,qfc) GJ, 


where Rf = R^^ {pk, qk) = (Pk, iPk,qk))- 

In order to get an insight on how the optimal solution of 
(l27l l may look like, consider a simple scenario in which there 
is only a sum power constraint at the TX and the RX, i.e., the 
constraints in (I26bb . (I26cb has to be satisfied for only I = K. In 
this case, by Jensen’s inequality, the uniform power allocation 
at the TX and the RX is optimaj^. However, due to the EH 
constraints, this may not be feasible. Using this intuition, we 
can see that the optimal policy tries to equalize the powers as 
much as possible, while satisfying the EH constraints. Next, 
we consider the case in which the EH profiles at the TX and 
the RX are similar, and show that the optimization problem is 
considerably simplified. 


A. Similar EPl Profiles 

The EH profiles are similar in the sense that the most 
majorized feasible vectors obtained from the EH profiles of 
the TX and RX, p* and q*, have the same structure, i.e., if 
p* = Ci,Vj G [m : n], then q* = C 2 ,V* £ [m : n] for some 
constants ci,C 2 > 0. We now give a formal definition. 

Definition 4: By using the OEA algorithm, let [q*,§r] = 
OEA(A:,{e[/L}) and [p*,§t] = OEA{K, {el/LT}). EH 
profiles at the TX and the RX are said to be similar if §r = Sit- 

Erom Section ini we can see that the definition of majoriza- 
tion for the vector case does not directly extend to the matrix 
case. If OEA algorithm is used at the TX and RX separately, 
we get the most individually majorized power vectors, which 
in general may not be the optimal solution of (l27l) . However, 
we now show that if the EH profiles are similar, the above 
mentioned approach is indeed optimal. 

Proposition 4: If the EH profiles at the TX and the RX are 
similar then {q*,p*, t*) is the global optimum of (l26b . where 
q* d: q,P* d: P, V {q,p,T) G jj, and is the solution of 



0, Vfc G [1 : iT]. 


(31) 


Proof: See Appendix. 


B. Different EPl Profiles 

Unfortunately, we could not find a simple algorithm to solve 
(l26b in a general setting where the EH profiles are not similar. 
In (l30b . if one variable is fixed, optimizing over the other vari¬ 
able has a directional or staircase water-filling interpretation 
El, ESI, however, the difficulty lies in the fact that there is 
no closed form expression for R%^ ■ Nonetheless, based on the 
convexity of the objective function, some properties of the 
optimal solution are given below. 

Lemma 2: Under the optimal policy, the transmission power 
Pk, and the energy used to send the feedback qk are non¬ 
decreasing in k,\/k G [1 : K], 

Lemma 3: Under the optimal policy, at the time instants at 
which changes, the energy buffer of either the TX or the 
RX is emptied. 

The proofs of the above lemmas are given in Appendix. 

^ In this section, with slight abuse of terminology we use the terms RX 
power and RX energy interchangeably. 


(30) 
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Figure 4. Model for a solar energy harvesting profile. 

VI. Numerical Results 

We start by considering the case in which the RX harvests 
energy, while the TX has a constant power supply. We assume 
that the RX is equipped with a solar EH device. Following 
ED, solar irradiance data is taken from the database reported 
in lf22l . Each EH interval is of duration A = 1 hour, T = 200 
ms, resulting in L = 18000 frames. The harvested power from 
the irradiance data can be calculated as, pharv = /[Watt/m^] x 
Area\m?] x p, where p is the efficiency of the harvester. A 
hypothetical solar panel of variable area is assumed. The area 
of the panel is adjusted such that we have the EH profile shown 
in Fig. |4] at the RX. In Fig. |4l the harvested power to noise 
ratio (HPN) in each EH interval is shown. 

Using this EH profile, throughput of different feedback 
policies is shown in Fig. |5] In Fig. |5] OEA represents the 
proposed policy in which the energy vector is obtained by 
using the OEA algorithm, and then the optimal time span 
of feedback is obtained by solving (l20l i. In the greedy 
scheme, the consumed energy is equal to the harvested energy 
in that interval, i.e., qk = e^/L, and then optimization is 
performed only over Tk, given qk- The performance of the 
above policies when the feedback bits are rounded to the 
largest previous integer is also shown. We can see that the 
proposed approach outperforms the greedy policy by 1.6 dB 
at a rate of 4 bits/s/Hz. Also the rate loss due to bit rounding is 
negligible. In Fig. |6] feedback bit allocation is shown for the 
above mentioned policies for a downlink SNR of 10 dB. From 
Fig. |6] we can see that with the proposed strategy, feedback 
bit allocation is equalized as much as possible. 

We now consider the case in which both the TX and the 
RX harvest energy, with similar EH profiles. The same EH 
profile in Fig. |4]is separately used at both the RX and the TX, 
hence the EH profiles are similar. In Fig. |7] the throughput 
of different schemes is shown at various mean HPN values at 
the TX. The mean HPN at the TX is varied by increasing the 
harvester area at the TX, i.e., the EH profile is multiplied by 
a positive number (area), while keeping the same shape and 
efficiency. In Fig. OEA represents the proposed policy in 
which the energy vector at the TX and the RX is obtained by 
using the OEA algorithm, and then the optimal time span of 
feedback is obtained by solving ( |29] |. In the greedy scheme. 



Figure 5. Ergodic rate with only an EH RX, and M = 4. 



Figure 6. Feedback load at downlink SNR of 10 dB, M = A. 

the allocated energy is equal to the harvested energy in that 
interval, i.e., at the TX pk = e\/LT, at the RX qk = e^/T, 
and then optimization is performed only over Tk, given pk 
and qk ■ The difference in throughput between the greedy and 
OEA is small when the average HPN is low, and it increases 
with the HPN. In contrast to the OEA scheme, using the 
greedy approach with the solar EH profile results in some EH 
intervals being allocated zero energy, and therefore does not 
scale by increasing the harvester area. This particularly hurts 
the greedy policy’s throughput in the high HPN regime as the 
multiplexing gain (pre-log factor) is reduced. 

Finally, we consider a case with non-similar EH profiles, 
where the EH profiles are generated independently at the 
TX and the RX, and they are i.i.d. with exponential distri¬ 
bution. EH profiles are verified not to be similar according 
to Definition |4] Similarly to Fig. |2l in Fig. the mean 
HPN at the TX is varied by multiplying the EH profile by 
a constant, while keeping the same shape. Since we could 
not find a simple algorithm in this case, CVX solver is used 
to solve the optimization problem El, and is denoted as 
CVX in Fig. [8] As we can see, the heuristic of using the 
OEA approach performs quite well even in the non-similar 
EH profile scenario. The energy allocation at the TX and the 










































Energy allocation at the RX 




Energy allocation at the TX 



Figure 7. Ergodic rate for similar EH profiles, M = 4. 


Figure 9. Energy allocation at the TX and the RX, M = 4. 



Figure 8. Ergodic rate for non-similai' EH profiles, M = 4. 


RX are shown in Fig. |9] for the above mentioned policies at 
an average per frame HPN of 0.5 dB at the TX. Different 
from Fig. 0 in Fig. [ 8 ] the rate scaling with average HPNs is 
same for both the greedy and the OEA policies. For the greedy 
policy, the allocated energy in an EH interval scales with the 
increasing mean HPN, in contrast to the solar EH profile, for 
which the allocated energy is zero in some intervals. 

VH. Conclusion 

In contrast to the existing literature on the design of energy 
harvesting communication systems, we have assumed in this 
paper that the perfect channel state information is available 
only at the receiver side; and we have studied the problem 
of CSI feedback design in a p2p MISO channel under EH 
constraints at both the TX and the RX. Since the exact expres¬ 
sions of throughput are complicated, concave upper bounds 
have been used in the optimization problems. We have first 
considered the case in which only the RX harvests energy, and 
optimized the feedback policy under EH constraints. Later, the 
general case, in which both the TX and the RX harvest energy, 
is analyzed. We have shown that, if EH prohles are similar, 
the optimization problem can be considerably simplihed. We 


remark that the result obtained in Proposition|4]is general, and, 
for example, it can be used in a network setting in which a 
concave utility is to be maximized in the presence of EH nodes 
with similar harvesting profiles and infinite size energy buffers. 
Numerical results show that the proposed policies not only 
outperform the greedy policy, but also achieve performances 
very close to the theoretical upper bound. Our work sheds 
light on the design of feedback-enabled multi-antenna systems 
when the nodes depend on EH devices for their energy. 


Appendix 


A. Proof of Lemma Q] 

Let Xi = [xi yi ,X 2 = [x 2 y 2 ^ 2 ]"^, we have 
h (AXi -f (1 — A) X 2 ) 

Xyi + {l-X)y2 f {x,t)\ 


= 0 <?( 

(a) 

> ©5 


= 05 


0 ’ ® / 

^ Xyi + (1 - A) 52 A/i + (1 - A )/2 
^ 0 ’ 0 
012/1 , 022/2 0l/l , 02/2 


0ai Qa2 ’ 0Q!i 


002 


(32) 


V Oti Qi 


a2 Oi2 


= Xh (Xi) + (l-A)h(X 2 ), 


where x = Axi -I- (1 — A) X 2 , t = Afi -I- (1 — A) t 2 , 
fi = fixifi), /2 = fix 2 ,t 2 ), 01 = A(l-^) and 
02 4 (1 _ A) (1 - , 0 = 01 + 02, ai = (1 - ^), 

a 2 = (1 — Here 

(a) follows from the fact that / {x, t) is concave, and 5 ( 5 , z) 
is monotonically increasing in each argument, 

(b) follows from the fact that ^ -f ^ = 1 , and 5 ( 5 , z) is 
concave. 


B. Proof of Proposition Q] 

Reproducing the ergodic rate bound in ® with pk = P, Vfc, 
we have 


R'" {qk,Tk) 


tk log2 1 + 


PhX 

tk ) 


(33) 
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Where 4 4 A (^ - (M - 1)(1 + ^)^). 

Since b^. in @ is concave in qf. and Tk, it can be easily seen 

— { \ -W-^I 

that 2 “= (1 + t® convex, and hence, is 

concave. Using Lemma[T]with g {y, z) = log 2 (1 + z) and f^, 
we can see that is concave. Since the objective function 
in O is the summation of it is also concave. 


C. Proof of Proposition |5] 

First, we show that g {y, z) = log 2 (1 + (1 + y) z ), (y, z) G 
is concave for y > 0, z > 1. The Hessian of g is given by 


J 



-i^ + yf 


(34) 


where /3 = logg 2 (1 + (1 + y) z) > 0. Consider = 

— ^ ^a^z^ + 6 ^ (1 + y)^ — 2 a 6 ^, where u = [a b]^ G 

R^. It can be easily seen that < 0 for ab < 

0. For ab > 0, since z(l + y) > 1, u^Ju = 

— ^ (az — b{l + y))^ + 2ab (z (1 + y) — 1) < 0. As Hes¬ 

sian is negative semidefinite, g (y, z) is concave. Reproducing 
the ergodic rate bound in (l22li . we have 


i?r = 4iog2(i + (i + -^i^ 


Pk\ fk 


(35) 


where tk and fk are as defined before. 

By following the similar steps in Proposition [T] fk can be 
shown to be concave. Using Lemma [T] with g (y, z) and fk, 
we can see that Rf’^ is concave. Since the objective function 
in (l26l) is the summation of it is also concave. 


D. Proof of Proposition 0 

First, (p*, q*) is shown to be the solution of dSOl l and then 
T* is obtained by OTT i. Before solving (l30l l. we prove that 


K 

{P*, q*) =arg max ^g{pk,qk) 

9,Pk:qk (3o) 

s.t. Vfc, {Pk,qk) G 3,y G C, 

where € is the set of all continuous concave functions. As (l30l l 
is a special case of (l36l l. {p*, q*) is also the solution of ( l30l l. 

Before starting, we note that the notations and properties 
of the OEA algorithm discussed in Section IIV-AI are used 
throughout the proof. By contradiction, let us assume that there 
exists a [p"'’ < 7 "'’] [p*"*" q*"^] and (p, q) be the solution 

of (l36l l. Then, by Theorem Owe have, 

[p^ eV (37) 

Since (p*, q*) G jl, by (iJTl l and Definition 0 


[P^ q^] 


T 



By the feasibility constraint in (I26bl i. 


(38) 


E E sVi’ 


where Bf^ are the energy band indices as explained in Section 

Applying (l39l l for i = 1, and remembering that Bq — 0, we 
get 

Bi Bi K 

E'?j = EE9*^*.^-(40) 

7=1 7=1 i=l 

By (PI) and (P2) in Section HV-AI q* = + Li, where 

L, = 0 V7 G [1 : Bi ], 

, (41) 

Li > 0 V7 G [Bi + 1 : AT] . 

From (l40l l and (|4TI) 

Bi K Bi K 

EE 9(1) + E E LA,j<Vi. (42) 

j=l i=l j=l i=Bi-|-l 

Using the fact that D is doubly stochastic and by (PI), 
Biy^i) = Vi, and we have 

Bi K 

^ ^ L,d,,, < 0. (43) 

7=1 i=Bi + l 

From (HTt and ( |4^ . we get 

d,., =0, V7 G [Bi + 1 : B] , Vj G [1 : Bi] . (44) 

As D is doubly stochastic, using (PI) and (l44l i. 

Bi Bi 

7^ = E 9(1) E = 9(1) = 9 ;, Vj G [1 : Bi]. (45) 

Since D is doubly stochastic, using (l44l t, we get 

Bi K Bi 

2 d.,, = Bi, 2 d,,, = 1, Vj G [1 : Bi]. (46) 

2=1 j—1 i—1 

We can rewrite (1461 ) as 

Bi K Bi Bi Bi K 

E E = E E +E E ^*’1’ (47) 

i=l 7=1 i=l 7=1 i=l j=Bi+l 

from which it follows that 

Bi K 

E E (48) 

i=i7=Si+l 

and hence, 

d,,, =0, V^G[l:Bl],VjG[Bl + l:iT]. (49) 

Then applying (IWb for i = 2, 

B 2 B 2 K 

E 9i= E T.dtd^,,<V2. (50) 

7=Bi-|-l 7=Bi + l i=l 

By (PI) and (P2), we have q* = q*^ + Li, where 

L, < 0 V7 G [1 : Bi], 

L, = 0 V7 G [Bi + 1 : B 2 ] , 

L, > 0 V7 G [B 2 + 1 : AT] . 


(39) 


(51) 
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From ( l50b and ( ISTT ). 

B2 K B2 K 

E + E E9(2)4j <^2 . (52) 

2=1 j=B\-\-l i—1 


Since the objective function is concave, by Jensen’s inequality, 
the new policy strictly increases the objective. Finally consid¬ 
ering the case where pk < Pk+i, Qk > Qk+h we can construct 
another feasible policy. 


Since D is doubly stochastic, by (PI), we obtain 
{B 2 — = ^ 2 , and using (|49] | and (ISTl l in (l52l l. we 

get 

B 2 K 

E E Lidij <Q,Li> Q. (53) 

j=Bi+li=B2-l-l 

From (fSTI) and (l53]) it can be concluded that 

=0, Vi e [B 2 -f 1 : Ff], Vj G [Bi -f 1 : B 2 ]. (54) 

As D is doubly stochastic, using (PI) together with ( l49l l and 
dH, we have 

B 2 

9j =9(2) E =9(2) =9;,Vj G [Bi + 1 :i? 2 ]. (55) 
Again, since D is doubly stochastic, using ( l49l l and ( l54l i. 



B 2 

K 






E E 

= B 2 — Bi, 





i=Bi + li 

B 2 

i=i 




(56) 


E ' 

P — ^ 

— J- 

, Vj G [Si + 1 

■-B 2 ]- 




i=Bi + l 






We can rewrite (l56ll 

as 





B 2 K 


B 2 

B 2 

B 2 

K 


E E 

di+ — 

E 

E + 

E 

E 

dij- 

i=Bi+l j=l 

i 

=Bi + lj. 

=Bi + l i= 


= B2+1 








(57) 

From (I 57 I 1 we can see that 







B 2 

K 






E 

E ^41 = 0, 



(58) 


i- 

=Bi + lj-- 

=B2-I-1 





and hence. 


d,^j = 0, V^ G [Bi + 1 : B 2 ] and Vj G [B 2 -f 1 : AT]. (59) 

Continuing this approach for i = 3, (|§r| — 1), we get 
q — q* ■ Since the EH profiles are similar, i.e., S^. = §t, 
replacing q hy p and ej by e‘/T in the above proof, we 
reach the similar conclusion for p, i.e., p = p*. Therefore, 


E. Proof of Lemma |2] 


Assume that at least one of the pk, qk is not monotonically 
increasing in k. Without loss of generality (w.l.o.s) we consider 
the cases in which pk > Pk+i, Qk > dk+i and pk < Pk+i, Qk > 
Qk+i- In the case of pk > Pk+i, Qk > Qk+i, we can construct 
a new feasible policy. 


Pk — Pk-\-l 
Qk = qk-\-l 


Pk + Pk+1 


Qk + Qk+l 


( 60 ) 


Pk —Pfei Pk+1 — Pk+l^ /-cix 

(61) 

qk = qk+i, qk +1 = qk- 

The function with variables p, q, r can be written as. 


R-^{p,q,T)=t\og, 





(62) 


where f = M — {M — 1) (l -f -^) , t = 1 — ^ and 

0 < T < T. The second order partial derivative of {p, q, r) 
is given by, 

=_S _ (63) 

dpdq t{l + f/t + pf/f^f 

Since / is monotonic in q, dhll) is positive. As > 0, by 

the definition of derivative, 

i?"'’ (p, q, t) + i?”'’ {p + 5,q + a,T) > 

{p + 5, q, t) -f (p, q + a,T) , a > 0. 

(64) 

Since (l64l l holds for all 0 < r < T, we have 

A”'' (p, q) + it'" {q + 5,q + a) > 

i?"'(p + <5,g)+A“^p,q + a), 

where is obtained by, 

^ jiub ^ (gg) 

r 

Finally, using dSB and (l65l l we can see that the newly 
constructed policy strictly increases the objective. 


F. Proof of Lemma |5] 

Let us assume that the transmission rates in the F-th 
and the k + 1 -th intervals are different, i.e., {Pk,qk) 7 ^ 
{pk+i,qk+i)- Before the F-fl-th interval, the energy in the 
buffers of TX and the RX are Ar = 9* 

At = Y1 ^=iPb respectively. W.l.o.s, we assume 

that Ar < At- We can construct another feasible policy 

Pk = Pk + S, Pk+1 = Pk+1 - S, 

, X ~ X 

9 fc = 9 fe + 0 , qk+l = qk+l — 0 , 

where S is chosen such that 6 < Ar and qk < (jk+i- Now, 
(l67l) can be written as 

Pk = apk + (1 - cfjpk+i, Pk+1 = {I-a) pk + apk+ij 

qk = aqk + (1 - a) qk+l, dk+i = {l-a)qk + aqk+i, 

(68) 

where a = 1 — 6/ {qk+i — qk)- Using Jensen’s inequality 

k+l k+1 

E^“'fe,g,)>E^“'fe7 9,), (69) 

j=k j=k 

which concludes the proof. 


2 
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